ZMCintegral-v5.1: Support for multi-function integrations on GPUs

نویسندگان

چکیده

In this new version of ZMCintegral, we have added the functionality multi-function integrations, i.e. ability to integrate more than 103 different functions on GPUs. The Python API remains similar as previous versions. For integrands less 5 dimensions, it usually takes 10 minutes finish evaluation integrations one Tesla v100 card. performance scales linearly with increasing Program Title: ZMCintegral CPC Library link program files: https://doi.org/10.17632/p7wc7k6mpp.3 Licensing provisions: Apache-2.0 Programming language: Journal reference version: Hong-Zhong Wu, Jun-Jie Zhang, Long-Gang Pang, Qun Wang, Comput. Phys. Commun. 248 (2020) 106962 and 251 107240 Does supersede version?: Yes Reasons for When solving Boltzmann equation radiations [1], encounters collision integrals energy beams. relativistic QED plasma, terms involve various Feynman graphs [2] contribution from each graph is great interest. these circumstances needs many forms simultaneously. our versions [3,4], focused single integration high dimensions parameters. Therefore, necessary include integrating which domains. Summary revisions: • Multi-function Suppose a series defined as(1)fn(x)=ancos(kn⋅x)+bnsin(kn⋅x), where n=1,2,3,...,100. above can be treated set Harmonic bases if wishes evaluate mode. versions, cannot manipulated in convenient efficient way. It worth noting that domains or different, example(2)gn(x1,x2)=an|x1+x2|for 0<n<50gn(x1,x2,x3)=bn|x1+x2−x3|for 50≤n≤100. support multi-functions gives users full flexibility possible. Test GPUs As an illustrative example, report solution Eq. (1) x=(x1,x2,x3,x4), an=bn=1. ranges all components are taken [0,1] kn=(n+502π,n+502π,n+502π,n+502π) such highly fluctuating around zero line. hardware condition case be: Intel(R) Xeon(R) Silver 4110 [email protected] CPU processors + Nvidia V100 GPU. results shown Fig. 1. Nature problem: easy use package doing dimensional distributed GPU clusters. Using libraries Numba [5] Ray [6], well NVIDIA CUDA [7] capability, offers succinct interface numerical physical problems. updated version, mainly focus problems integrate. These take Solution method: This contains three classes. ZMCintegral_normal utilizes stratified-sampling heuristic-tree-search techniques, while ZMCintegral_functional ZMCintegral_multifunctions direct-Monte Carlo method integrand benefits heavily Additional comments including restrictions unusual features: If high-dimensional (e.g. dimensionality 8-12), encouraged ZMCintegral_normal. middle-dimensional 1-7) but large parameter space, suggest try ZMCintegral_functional. contain 104 integrations), then suggested. detailed instructions found here: [8]. J. Oxenius, Kinetic Theory Particles Photons, Springer Berlin Heidelberg, 1986, https://dx.doi.org/10.1007/978-3-642-70728-5. V. Morozov, G. Röpke, theory radiation non-equilibrium plasmas, Ann. 324 (6) (2009) 1261–1302, https://dx.doi.org/10.1016/j.aop.2009.02.001. H.-Z. J.-J. L.-G. Q. ZMCintegral: A multi-dimensional Monte multi-GPUs, (2019) https://dx.doi.org/10.1016/j.cpc.2019.106962. ZMCintegral-v5: Support scanning space 107240, https://dx.doi.org/10.1016/j.cpc.2020.107240. S. Kwan Lam, A. Pitrou, Seibert, Numba: llvm-based python jit compiler, 2015, pp. 1–6, https://dx.doi.org/10.1145/2833157.2833162. P. Moritz, R. Nishihara, Tumanov, Liaw, E. Liang, M. Elibol, Z. Yang, W. Paul, M.I. Jordan, I. Stoica, Ray: framework emerging AI applications, in: 13th USENIX Symposium Operating Systems Design Implementation (OSDI 18), Association, Carlsbad, CA, 2018, 561–577, https://www.usenix.org/conference/osdi18/presentation/moritz. Buck, T. Purcell, Toolkit Computation GPUs, 2004. https://github.com/Letianwu/ZMCintegral.git.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Architectural Support for Address Translation on GPUs

The proliferation of heterogeneous compute platforms, of which CPU/GPU is a prevalent example, necessitates a manageable programming model to ensure widespread adoption. A key component of this is a shared unified address space between the heterogeneous units to obtain the programmability benefits of virtual memory. Indeed, processor vendors have already begun embracing heterogeneous systems wi...

متن کامل

Multi-Elimination ILU Preconditioners on GPUs

Iterative solvers for sparse linear systems often benefit from using preconditioners. While there are implementations for many iterative methods that leverage the computing power of accelerators, porting the latest developments in preconditioners to accelerators has been challenging. In this paper we develop a selfadaptive multi-elimination preconditioner for graphics processing units (GPUs). T...

متن کامل

compactifications and function spaces on weighted semigruops

chapter one is devoted to a moderate discussion on preliminaries, according to our requirements. chapter two which is based on our work in (24) is devoted introducting weighted semigroups (s, w), and studying some famous function spaces on them, especially the relations between go (s, w) and other function speces are invesigated. in fact this chapter is a complement to (32). one of the main fea...

15 صفحه اول

Multi-scale Unbiased Diffeomorphic Atlas Construction on Multi-GPUs

In this chapter, we present a high performance multi-scale 3D image processing framework to exploit the parallel processing power of multiple graphic processing units (Multi-GPUs) for medical image analysis. We developed GPU algorithms and data structures that can be applied to a wide range of 3D image processing applications and efficiently exploit the computational power and massive bandwidth...

متن کامل

Multiclass Classification using Support Vector Machines on GPUs

The scaling of serial algorithms cannot rely on the improvement of CPUs anymore. The performance of classical Support Vector Machine (SVM) implementations has reached its limit and the arrival of the multi core era requires these algorithms to adapt to a new parallel scenario. Graphics Processing Units (GPU) have arisen as high performance platforms to implement data parallel algorithms. In thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Physics Communications

سال: 2021

ISSN: ['1879-2944', '0010-4655']

DOI: https://doi.org/10.1016/j.cpc.2021.107994